Model Selection

Continuous Action Space

# Continuous Action Space

Ppo LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.

Ppo LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.

Mlagents Crawler

This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for reinforcement learning tasks in the Crawler environment.

Molecular Model

Ppo LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.

Ppo LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to control the safe landing of a lunar lander.

Sac Walker2d V3

This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.

Assignment2 Omar

This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.

Classroom-workshop

This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.

This is a PPO reinforcement learning model trained based on the stable-baselines3 library, specifically designed for continuous control tasks in the Hopper-v3 environment.

Ppo HalfCheetah V3

This is a reinforcement learning model based on the PPO algorithm, specifically designed for the HalfCheetah-v3 environment and trained using the stable-baselines3 library.

Ppo MountainCar V0

This is a deep reinforcement learning model based on the PPO algorithm, specifically designed to solve control problems in the MountainCar-v0 environment.

Sac Pendulum V1

This is a reinforcement learning model based on the SAC algorithm, designed to solve control problems in the Pendulum-v1 environment.

PPO LunarLander V2

This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control the lunar lander.

Dqn MountainCar V0

This is a DQN agent model trained using stable-baselines3, specifically designed to solve reinforcement learning tasks in the MountainCar-v0 environment.

Molecular Model

Ppo Pendulum V1

This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve control problems in the Pendulum-v1 environment.

This is a reinforcement learning agent trained with the PPO algorithm, designed to control the balancing ball task in the Unity 3DBall game.

Decision Transformer Gym Hopper Medium

This is a decision transformer model trained on medium-performance trajectories in the Gym Hopper environment, suitable for continuous control tasks.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase